Noise-driven Temporal Trajectory Filtering of Spectral Parameters for Robust Speech Recognition

نویسندگان

  • Chandra Kant Raut
  • Takuya Nishimoto
  • Shigeki Sagayama
چکیده

Spectral parameter filtering such as RASTA [5], RASTA-like bandpass filtering and high-pass filtering operates on temporal dynamics of spectral parameters, and has been effective method to reduce channel distortions. Temporal derivatives (delta and delta-delta coefficients, that have proved as robust representation) and spectral mean normalization [4] are also equivalent to filtering that reject lower modulation frequencies from spectral parameters. Spectral normalization and parameter filtering assume that channel distortions are linear and additive noise is negligible [2]. As the additive noise and channel distortions are additive in different domains, they cannot be simultaneously suppressed by these methods [2]. In this paper, we extend the filtering technique of temporal trajectories to handle additive noise. The paper investigates the possibility of applying ‘spectral subtraction’ [3] filter to time-trajectories of spectral parameters and the problems that may arise.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Robust Speech Recognition Features Based on Temporal Trajectory Filtering and Non-Uniform Spectral Compression

This paper proposes a new feature extraction method based on temporal trajectory filtering and nonuniform spectral compression and examines its performance with two tasks in noisy environments. Temporal trajectory filtering is effective for robust speech recognition in noisy environments, due to human hearing is more sensitive to relative values rather than absolute values and the effect of add...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

A new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering

In this paper, a new feature extraction front-end for robust speech recognition using progressive histogram equalization and multi-eigenvector temporal filtering is proposed. The progressive histogram equalization (PHEQ) performs the histogram equalization (HEQ) progressively with respect to a reference interval which moves with the present frame to be processed. The multi-eigenvector temporal ...

متن کامل

Robust Speech Recognition for Adverse Environments

As the state-of-the-art speech recognizers can achieve a very high recognition rate for clean speech, the recognition performance generally degrades drastically under noisy environments. Noise-robust speech recognition has become an important task for speech recognition in adverse environments. Recent research on noise-robust speech recognition mostly focused on two directions: (1) removing the...

متن کامل

Signal Trajectory Based Noise Compensation for Robust Speech Recognition

This paper presents a novel signal trajectory based noise compensation algorithm for robust speech recognition. Its performance is evaluated on the Aurora 2 database. The algorithm consists of two processing stages: 1) noise spectrum is estimated using trajectory autosegmentation and clustering, so that spectral subtraction can be performed to roughly estimate the clean speech trajectories; 2) ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004